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Recently, number of data intensive workflow have been generated with 
growth of internet of things (IoT’s) technologies. Heterogeneous cloud 
framework has been emphasized by existing methodologies for executing 
these data-intensive workflows. Efficient resource scheduling plays a very 
important role provisioning workload execution on Heterogeneous cloud 
framework. Building tradeoff model in meeting energy constraint and 
workload task deadline requirement is challenging. Recently, number of 
multi-objective-based workload scheduling aimed at minimizing power 
budget and meeting task deadline constraint. However, these models induce 
significant overhead when demand and number of processing core increases. 
For addressing research problem here, the workload is modelled by 
considering each sub-task require dynamic memory, cache, accessible slots, 
execution time, and I/O access requirement. Thus, for utilizing resource 
more efficiently better cache resource management is needed. Here efficient 


resource utilization (ERU) model is presented. The ERU model is designed 
to utilize cache resource more efficiently and reduce last level cache failure 
and meeting workload task deadline prerequisite. The ERU model is very 
efficient when compared with standard resource management methodology 
in terms of reducing execution time, power consumption, and energy 
consumption for execution scientific workloads on heterogeneous cloud 
platform. 
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1. INTRODUCTION 

Unanimous growth and advancement in fabrication industry, number of transistors can be easily 
embedded into one single chip, also recent development in processor design has caused for adding several 
central processing unit (CPU) core and large cache in single chip to enhance the performance. However, this 
has also caused several power mechanism issue which affects the device reliability, device performance and 
battery life of device. Moreover, power consumption is considered as one of the major reason that lead to the 
architecture shift towards the multicore chips, this tends to manage the demand in increase in frequency. 
However, in order to keep up the performance flawless many number of cores needed to be added to the 
given processor, this makes the power one of the important factor again; the main challenge is managing 
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huge number of cores for delivering high performance with low power consumption. The performance can be 
improvised through two scenarios i.e. either through increasing the core and communication width or through 
increasing frequency. These two scenarios can result in power consumption; further energy is considered as 
the byproduct of performance and power. Hence it is another motivation for researching the relationship 
among them. 

Power consumption issue has been tackled through the various existing technique such as dynamic 
power management (DPM) and dynamic voltage frequency scaling (DVFS) [1], in here DVFS controller 
detects the computational patterns in execution process and further determines the voltage scaling and 
frequency scaling of CPU core to reduce the energy consumption. Moreover, several processor designs have 
adopted DVFS per core. For instance, Intel’s boost technology and LITTLE cores, where each core holds the 
capability for scaling the level of voltage and frequency; further in multi-core model, extended control 
protocol is implied for controlling the core to reduce power consumption [2]. Further other studies have 
focused energy efficiency indirectly through performance improvisation as in [3], [4], machine learning 
based approach is used for the complex problem such of minimizing the multicore power consumption [5]. 
These mechanisms can be applied efficiently to the core; however, such strategy cannot be scaled to the 
given chip level where management is required to be applied in given coordinated type. Moreover, the 
overhead of given technique are unacceptable as the number of core increases [6]. Further, reinforcement 
learning is employed to manage the power consumption due to prior knowledge and also it is adaptive [7]. 
However, all theses DVFS suffers heavily from the low supply voltage constraint and failed to consider the 
cache reliability and also none of them considered the power allocation among chip clusters dynamically in 
order to improve the performance or to save power, simultaneously maintaining the scalability of model. 

CPU utilization control using feedback information [8] has shown remarkable performance to 
provide the real time guarantees through workload variations adaption based on given dynamic feedback. 
Moreover, the main aim of utilization control is enforcing the proper utilization of schedulable bounds in real 
time scenario on entire processors. This needs to be achieved despite of uncertainty in workload. Hence it is 
observed that utilization control is capable of meeting all the deadlines of real-time without proper 
knowledge of workload such as execution times of task. Further power aware utilization is focused by few 
researchers to achieve the reduction in power consumption and real time guarantees [9]-[11], however the 
existing work on this mainly depends on the DVFS through assumption that execution time of task can be 
easily adapted along with CPU frequency. More over the assumption is very much valid considering the real 
time scenario for task that are memory intensive and computation intensive and have 75% of instructions as 
store or load. Furthermore, when particular processor is in process CPU frequency and memory intensive 
tasks are set to highest level, further utilization can also exceed the schedulable bound this results in missing 
undesired deadlines. Moreover, cache size is divided to core and can be maximized to minimize the cache 
access latency and cache miss rate which is occurred due to the less memory access delay. Hence CPU 
utilization is lowered for efficient real time scenario, further if utilization is comparatively lower than given 
bound while the frequency is at the lowest level, size of active cache can be minimized further and other 
cache units which are rarely used can be used through putting low power mode to reduce the cache leakage 
power [12]. 

In this paper, two-phase utilization was proposed for energy efficiency in the real time scenario for 
heterogeneous multi-core processing environment, at core level this mechanism utilizes dynamic L2 cache 
partition and per-core DVFS for addressing the objectives such as reducing the core energy consumption and 
controlling the CPU utilization for each core. Moreover, utilization occurred due to the periodic real time can 
be determined through frequency independent and frequency dependent execution time. Further cache 
partitioning and per-core DVFS is used for adopting the dependent and independent portions of frequency, 
respectively [13], [14]. However, a main challenge here is traditional control theory optimization model [15] 
does not handle the optimization objectives. Further, the decision making process to optimize 
voltage-frequency scaling and last level cache controller are made separately, leading to increased quality of 
service (QoS) violation when last level cache controller aims to reduce global last level caches. Existing 
cache optimized based workload scheduling achieves very poor energy efficiency as they don’t consider 
optimizing voltage-frequency scaling in dynamic manner. For example, using less cache resource for 
execution of workload can lead to increased scaling of voltage-frequency; thus higher energy for task 
execution is incurred due to quadratic effects. Hence in this paper an efficient resource management 
technique is developed which was based on the multi-objective control theory [16], [17] to optimize the 
above two objectives (i.e., utilize cache resource more efficiently with minimal execution time for scheduling 
scientific workload under heterogeneous cloud computational framework). The efficient resource utilization 
(ERU) methodology are modelled by employing efficient cache resource usage by optimizing voltage- 
frequency in dynamic manner and last level cache partitioning in heterogeneous cloud framework. The ERU 
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model can coordinate the cache size from the given core and further the dynamic cache is resized to reduce 

the leakage in power consumption of last level caches (LLCs). 

The significance of efficient resource utilization model: 

— Here efficient resource utilization for execution of data-intensive application for heterogeneous cloud 
framework is presented. 

—  ERU model reduces LLC failures with better cache and V/F scaling optimization in dynamic manner; 
thus utilizes resource more efficiently. 

— The ERU methodology achieves very good outcomes when compared with standard resource utilization 
methodology considering performance metrics such as power consumption, energy efficiency, and 
execution time. 

The paper organization is as follows: in section 2, survey for underlying benefit and limitation of 
using state-of-art workload scheduling is described. In section 3, the proposed efficient resource utilization 
working model is described. In section 4, experiment outcome obtained by ERU over various existing 
workload scheduling model is detailed. Lastly, the research is concluded and future direction wok are 
discussed. 


2. LITERATURE SURVEY 

This section presents analysis of various existing workload scheduling algorithm under cloud 
computing environment. In [18]-[20] observed that energy is one of the major concern while designing the 
multicore chips, here performance and power are two primary energy components which are inversely related 
to each other, in here multicore chips optimization which process on the parallel load using either 
performance optimization or power optimization. Hence to achieve that machine learning model was 
developed based on the dynamic and global controller of power management, moreover controller is used for 
reducing the power consumption and increases the performance in given power budget. Further it is observed 
that controller is scalable and does not possess much overhead as there is increase in demand. In [21] 
observed that existing methods tries to switch off several processors through combining the task on fewer 
processors for reducing the energy consumption for deadline constrained. However, it is observed that 
turning of processor might not be necessary to reduce the energy consumption, hence they proposed energy- 
aware processor merging (EPM) mechanism to choose the particular processor to switch off for energy 
consumption and quick-EPM was developed to minimize the computational overheads. In [22] proposes cost 
and energy aware scheduling (CEAS) technique for the cloud scheduler to reduce the workflow execution 
cost and minimize the energy consumption which meets the deadline prerequisite. In general, CEAS 
comprises five algorithms, at first virtual machine (VM) selection approach is used that applies the cost 
utility concept to map the task to their optimal VM-types through the make span constraint. Later two tasks 
aware technique were employed to minimize the energy consumption and execution cost, further to reuse the 
VM Instance, VM reuse policy is developed and at last slack time reclamation gets utilized to reduce the 
energy of these VM Instances. 

In [23] observed that any increase in chip temperature possesses various circuit errors, also there is 
large increase in leakage power consumption. Hence it applied task migration or traditional digital terrain 
model (DTM) technique for reducing the core temperature as these cores possess high temperature, also to 
compensate with high demand in data, last level cache (LLC) is attached which helps in reducing on leakage 
of power by occupying the chip area. Further to reduce the power consumption cache size are made to shrink 
dynamically, shrinking of cache size not only helps in leakage of power consumption but also helps in 
creating on chip thermal buffers to further minimize the temperature of chip though exploiting heat transfer. 
Moreover, resizing of cache are carried out based on cache hotspot generated while execution. In [24] 
proposed lead i.e. learning enabled-EAD (energy aware dynamic voltage) scaling for given multicore 
architecture using reinforcement learning and supervised learning. Further lead groups the link and its router 
into the same voltage-frequency domain and further implements the management strategies of proactive 
DVFS which mainly rely on the machine learning based offline model to provide the voltage-frequency 
selection among the pairs of voltage/frequency. Further three supervised learning model were developed 
based on energy/throughput change, buffer utilization change and buffer utilization, these allows the 
proactive mode selection mode on the basis of absolute prediction. Further reinforcement learning models 
were developed which optimizes selection of DVFS mode directly and also removes the requirement for 
threshold and label engineering. 

In [25] showed heterogeneous multi-core processing is adopted mainly in embedded system, as it 
provides the energy consumption minimization through applying the popular technique like DPM and DVFS. 
Moreover, effective management of energy based technique exploits the software and hardware level energy 
minimization technique. energy efficiency partitioning (EEP) is a software level technique where task 
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allocation to the given heterogeneous clusters impacts the whole energy model. Hence a technique was 
developed which couples the energy efficient partition problem along with task scheduling as task differ in 
terms of system on a chip (SoC) circuitry, active processing, execution path, I/O access, memory, cache and 
instruction mix, these affects the demand in power. Moreover, hardware frequency scaling is used for scaling 
to minimize the model energy. In [17], [26] proposed two resourceful workflow scheduling technique which 
considers the monetary cost as well as make span. Hence at first single objective workflow is developed 
named deadline-constrained cost optimization for hybrid clouds (DCOH) which was mainly deadline 
constrained cost optimized to minimize the scheduling cost under the given deadline. Furthermore, 
considering DCOH, multi-objective optimization is proposed for hybrid cloud named MOH to optimize the 
monetary cost and execution time of workflows. 

In [18] observed that single algorithm does not possess optimal solution under different power 
settings, dynamic slack and various workloads, further the device configuration variation affects the DVFS 
algorithm. Hence considering the adaptability this paper focused on developing the reinforcement learning 
that takes execution technique set which are specialized to handle the various conditions and switches to the 
best technique considering the situations. In [19] developed load balancing approach for allocating the 
non-real time on the given heterogeneous nodes, further they introduced the processing node frequency of the 
whole cycle for all the jobs that are assigned. In [20] developed an integer linear programming (ILP) based 
thread which takes the input through hardware performance which determines the characteristics of thread. 
Here they used the last level cache and instructions per second as measure of memory bandwidth and CPU 
load. Moreover, they used performance metric for optimizing the global thread to the core assignment. 
However, it is not suitable for the real time scenario and it requires to solve the problem of ILP periodically 
and hence this incurs the marginally scheduling overhead. 

From extensive survey it is seen the existing workload scheduling adopting multi-objective 
optimization induce computation complexity because of NP-hard problem. Further, DVFS_ based 
methodologies are highly influenced by fluctuation of clock frequency and when supply voltage is low these 
model induce significant degradation in performance. Further, focusing only on workload execution 
makespan metric will result in improper calculation of energy dissipation. This is because different task will 
have diverse execution paths, intrinsic cache usage, I/O access pattern. Further, it is noticed that power will 
not always be same even if workload is executed on same kind of processing element. This is because the 
jobs with higher cache and memory accessibility would incur higher energy. Thus, for overcoming research 
problem this work presents an efficient resource utilization model for executing scientific workload on 
heterogeneous computational environment in next section. 


3. AN EFFICIENT RESOURCE UTILIZATION TECHNIQUE FOR SCHEDULING SCIENTIFIC 
WORKLOAD IN CLOUD COMPUTING ENVIRONMENT 

This section present efficient resource utilization (ERU) technique for scheduling scientific 
workload on heterogeneous cloud framework. Here the cache resource is utilized more efficiently in order to 
improve cloud resource utilization outcomes. The ERU is designed in such a way that it minimizes LLC 
failures under shared caching environment. Here VM migration is done for meeting cache constraint in 
reducing LLC failures. The ERU methodology for executing workload in heterogeneous cloud framework is 
done through two phased as shown in Algorithm 1. First, for enhancing the bandwidth usage and system 
capacity, the virtual computing node shares its cache memory. Second, reconfiguration of virtual computing 
machines is done whenever last level cache failures occurs. 


Algorithm 1: Efficient resource utilization (ERU) technique for executing data-intensive workflow in 


heterogeneous computing framework. 
Step 1. Start 

Step 2. Compute and establish Ny, 
Step 3. Compute and establish W, 


Phase 1- 

Step 4. For each processing node j from 1 to y do 
Step 5. nx;-collects (j) 

Step 6. Wy-sort (nx; ) 

Step 7. Obtain (W, 

Step 8. End for 


Phase 2-Obtain processing node with maximal and minimal last level cache failures 
Step 9. Maximal Node-find Maximal Node (Ny, 

Step 10. Minimal Node-find Minimal Node (N, ) 

Step 11. Maximal VCM-find maximal VCM (Maximal Node) 

Step 12. Minimal VCM-find minimal VCM (Minimal Node) 

Step 13. if T<maximal Node;zrc-minimal Nodetrc then 
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Step 14. Interchange (maximal VCM, mimimal VCM) 
Step 15. Stop. 


The ERU methodology is designed to utilize cache resource more efficiently meeting energy 
constraint of heterogeneous platform [21]. Thus, the energy incurred P(t) in heterogeneous platform for 
respective (i.e., t™) interval of time is computed as follows 


P(t) = e(t)Ma (1) 


where, e(t) depicts the overall power induced considering frequency level of processing core By with L2 
cache size which is constant in nature with respect to respective time sessions M, for executing workload. 
Here Ma, represents the operating period for discharging different information of entire workload-task in 
process of tm session. Here the cache partitioning size with energy constraint are configured and frequency 
prerequisite for executing under such constraint is obtained as follows 


min Yi-ilVe -— vx (t)]? (2) 


a(t)|1 sksi,f,(t)|1sksi 


P(t) (3) 


a;(t)|1 ckatty()|1sksi 
where v;(t) represents core utilization B, in t‘” session instance, V, depicts resource utilization sets 
V = [Vj,......,Vj]? for respective frequency range of [Rox Ri x| for each processing core By, 
{a,(t)|1 <k <i} depicts cache memory partition size and {f,(t)|1 < k <i} depicts processing core 
operating frequency at t*” session instance for decreasing variance amid core utilization v(t) and utilization 
sets (V;,). 


The processing element of cloud computing framework is composed of two cache elements namely 
L1 cache and L2 cache. These caches are shared among different core in multi-core shared computational 
framework. Here each processing element has DVFS capability. Thus, aid in saving significant amount of 
energy resource. The cache memory is portioned for carrying out various task. Therefore, the L2 cache 
partition size is represented by a,(t) considering processing core size B,. The peak frequency size under 
certain B, is represented by ff, (¢). The (2) and (3) must satisfy following constraint 


Rix S f(t) S Rp, where,(1<k <i) (4) 


Di=14x(t) <A (5) 


where A depicts total L2 cache size available in heterogeneous cloud computing environment. 

The (2) depict the minimum energy dissipation for executing certain task under heterogeneous cloud 
computational framework under certain power generation e(t) for t®” session instance. The (3) depicts the 
processing machine frequency leis within in range of each processing core using ERU model. The frequency 
range variation depends on kind of processing element being used. The (5) depicts summation of every 
partitioned cache memory which is almost equal to total memory available. 

For each processing core, the variation among resource (i.e., core) utilization v,(t) and utilization 
sets V; is reduced utilizing cache aware resource utilization method by modifying the cache partition size and 
its core frequencies. However, optimizing frequencies based on different cache partition size in static manner 
induces overhead and affect the processing time of heterogeneous computational environment. Thus, for 
improving processing time a dynamic optimization model is presented. The model maintain ideal relationship 
among balancing v;(t), core frequency f(t), and optimizing feature a,(t) in t“” session instance. First, for 
respective core B,, the dynamic optimization model gives an ideal relationship among Dep (Ct), job operational 
time M,, and optimizing feature f,(¢) in t" session instance and a;,(t). Then, the relationship parameter 
byp(t) can be optimized in different manner such as frequency independent or frequency dependent as 
described in the (6) 


Dip (t) = Sp (t) + inp: (f(t) ) 


where Syp(t) depicts frequency, independent segment considering respective operational session instance 


Mxp as processing time of I/O devices does not rely upon frequency of individual core and txp. (fx (t)) 
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depicts frequency dependent segment because it depends on frequency of operating cores. The reserved 
cache memory for respective job operational instance M;,,, considering certain I/O device does not take part 
for executing jobs can be depicted as sx,(t). The parameter s,,(t) plays an ideal relationship among cache 
failure and cache memory size. The ideal relationship among Sxp(t), @xp(t), and allocated caches for 
heterogeneous computational framework B, can be estimated using following (7): 


7 Diep Qkp(t) + Hep 0 < Agp(t) < Xep 
Spl). = Constant Ayp(t) => X 
kp = “kp 
where Dy, Hyp are quantified jobs features, and X;, depicts the operational set size within job operational 
session instance My,y. The (7) shows that whenever operation set size Xzp is higher than agp(t), the cache 
memory size improves and aiding in minimizing operation session instance. In similar manner, if operation 
set size Xj, is lower than a,,(t), then cache failure will be higher and can’t be addressed by allocating 
additional cache memory. Thus, for managing job execution of real-time scientific application, the 
relationship among total independent frequency and operation session instance of each job in heterogeneous 
computing processing element B, and total cache size a,(t) given to processing element B, is established 
using following (8): 


s (t) = dp Dp’ Ay (t) + Xe Ap 0 < a, (t) < Xx (8) 
‘ Constant a,(t) > Xx 
where Dip’ = SCO and Xx = Dip Xxp - The (8) depicts cumulating of (7) for every job on heterogeneous 


computational processing element B,. Then, the proposed ERU model aids in minimizing interference among 
different processing element shared caches can be described using following (9): 


hy (t) i= De lxp Ukp (fy (t)) + ee Dap’ Up % (t) + Da Axp kp (9) 


where h,(t) depicts the estimated processing element resource utilization and q,, depicts job rate within 
operational session instance M,,,, for heterogeneous computing environment B,. Using (9), it can be shown 
that h,(t) is proportionally inverse with respect to processing element frequency f(t). The estimated 


variation in resource utilization Ah,(t) for heterogeneous computing framework B, is described using 
following (10) 


Ah, (t) = I, (t) De lxp Ukp + Aa, (t) Di Dep’ Vp (10) 


where Ah, (t) is a linear function with respect to [,(t) and Aa,(t), I,(t) = (=5)- (=) and 
k R(t 


Aa, (t) = a,(t) — a,(t — 1). The (10) substitute direct frequency utilization of processing element f;,(t) 
to I, (t). The (10) verifies that Ah; (t) proportional with respect to i,, and Digg Therefore, the cost function 
of heterogeneous computational environment can be minimized using regulator for heterogeneous processing 
element B;, using following (11) to (13): 


Ze(t) = DEallue(e + —1|t) — Pt,(e+¢-1)k) |]? + llueGlt)- ue G@-11H1? ap 
Rix S f,() < Rye (12) 


aK (t) Ss Agquota,k (13) 


where Pf,(t + 1]|t) depicts the pattern considering resource utilization influence/feature v,(t +c — 1|t) 
L(t) 
Aax (t) 


for estimating the pattern of the device in E operational session instances. The cache size a,(t) for 
heterogeneous computational framework B; is bounded by Qguota,x for satisfying (5). Thus, using dynamic 
model, the least square problem can be minimized, and cache memory can be optimized in efficient manner. 
The power consumption optimization can be described using efficient resource utilization model can be 
described using following (14) to (16): 


must change its present utilization influence v, (t) to Vz, u,(t) = | and E depicts the computed range 
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x(t) = Sfp (t)? + Yeag(t) + Cy (14) 
Ruy S f(t) S Ree (15) 
a, (t) Ss Aquota,k (16) 


where S;,Y,, and C, depicts the power factors of the heterogeneous computation framework processing 
element of virtual computing nodes. The power consumption of heterogeneous computational framework 
processing element can be described as cumulative of power consumed by different shared caches and 
processing element. The total power consumption are dependent on leakage power C, and dynamic power 
component S;f;,(t)?. Thus, the cache memory power consumption can be optimized using the ERU model. 
The benefit of reducing cache resource usage cost plays very important part in workload scheduling. 
Traditionally, the caching cost is measured in terms size of data used (i.e., high usage means higher cost). 
However, considering workload deadline prerequisite the novelty of this work is the cache benefits is 
measured in terms of response time. Therefore the caching benefits D, is measure as shown in (17): 


0 Q, =0 
D.= 


a 
@ Og * (User +m) Q, #0 


(17) 


where BWeache represent caching I/O bandwidth, J, represents the task data size, Useex signify time desired 
for enlisting data in cache partition, and Q, represents data re-accessibility from the cache. Using [21] the 
caching cost benefits are Max-Min for addressing data comparability issues 


Dren = eed (18) 


where D, describes minimal result of cache cost benefit and D; represents maximal result of caching cost 
benefit. Further, replacing data from cache may induce certain cost. For calculating replacing cost this this 
work takes the number of unused (i.e., garbage) partitions of a data as the measurement. If the particular data 
blocks are in active/hot mode, these data have less probabilities of being replaced. Thus, they exhibit less 
replacing cost. The access probabilities of each data block considering window sampling x can be defined 
using following (19) 


__ access» 

Qe ~ access (19) 
where Q» defines access probabilities of data blocks £, access» depicts number of time the data blocks ¢ 
being accessed, access depicts total number times access within time period x. Let us partion the session 
window into smaller sessions *1,% ,...,X,. Then, data blocks # access probabilities in different session 
segments are established: 


Qe, = access, /access, 


Qe, = access, /ACcess, (20) 


Qe, = accesse, /accessy 


In which, Qe depicts the probabilities of data blocks in session %;, accesse, depicts number of times the data 
blocks ? is being accessed in session x;, access, depicts total number of times all the data blocks is being 
accessed in session x;. The active mode data blocks ? can be established using following (12): 


Qe, _ Qe3 . Qe4 Qe. 


active, = 5 20 sald 
£. 


igok = (21) 


Qe, Qe, Veg Qe,-4 ey 


Thus, replacing cost of data can be described using following (22) 
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Ry = Ty (22) 


activep*Tp 


where & depicts partition size of data blocks for particular task, Jp standard data block size, active, depicts 
active mode of data blocks f, Q» data block £ access probabilities in session x. Similar to cache cost we 
apply max-min for computing replacing cost where R, describes minimal result of caching resource 
interchanging cost benefits and R; describes maximal result of caching resource interchanging cost benefits. 
In next section, experiment is conducted for validating cache aware resource utilization methodology over 
standard resource utilization methodology using data-intensive workload with different job-size. The ERU 
aid in achieving between energy efficiency and meet task deadline prerequisite with minimal execution time. 


4. RESULTS AND ANALYSIS 

Here, the performance of the system is tested on scientific workflow small ribonucleic acid (SRNA) 
identification protocol using high-throughput technology (SIPHT) using proposed efficient resource 
utilization model to verify high efficiency and lower energy consumption of proposed cache aware resource 
utilization model in heterogeneous computational framework. In this modern era, heterogeneous multi-core 
architectures have impressed all over across the globe in different areas such as industries, trading 
departments, and medical applications. Thus, due to extensive demand of multi-core architectures, cloud 
computing has also stated to add multi-core architecture support. Moreover, graphics processing unit (GPU) 
instances are favored in contrast to traditional CPU-based resources to improve speed and efficiency of the 
system. However, improper resource scheduling and enormous amount of energy consumption can reduce 
the performance of the model in an extensive manner. Therefore, a cache aware efficient resource utilization 
scientific workload scheduling method is introduced to ensure low energy consumption, high performance of 
the model and proper resource scheduling using heterogeneous multi-core architectures. This technique helps 
to speed up the process and performance of the model. 

Here, we have conducted various experiments using the proposed ERU model to find energy 
consumption, power sum, simulation time and average power results which are demonstrated in Table | with 
the help of SIPHT scientific dataset for various jobs 30, 60, 100 and 1000. Our proposed technique ensures 
very less energy consumption for running SIPHT scientific dataset for SIPHT 30 is 2812.991014 watts, 
SIPHT 60 is 3158.219947 watts, SIPHT 100 is 3174.261302 watts and SIPHT 1000 is 11211.22691 watts 
demonstrated in Table | which is highly reduced compared with other state-of-art techniques using similar 
statistics. Table 2 also demonstrates Execution time to finish the task using the proposed ERU technique for 
various jobs as 30, 50, 100 and 1000 with the help of SIPHT benchmark. The average power outcomes for 
SIPHT 30 is 21.99945901 W, SIPHT 60 is 21.9994593 W, SIPHT 100 is 21.9994591 W and SIPHT 1000 is 
21.99946127 W. Here, Table 2 represents average simulation time comparison of proposed ERU method 
with other state-of-art-techniques using scientific model SIPHT. Further, this section provides graphical 
representation of our simulated experiments for various jobs using SIPHT scientific dataset and compared 
outcome achieved with recent standard resource utilization methodology considering different performance 
metrics such as average power, energy consumption, simulation time and power sum. 


Table 1. Energy efficiency and execution time performance evaluation of proposed ERU model over 
existing DVFS based workload scheduling algorithm 


Parameters DVFS [10] ERU 
Sipht 30 Sipht 60 Sipht 100 Sipht 1000 Sipht 30 Sipht 60 Sipht 100 Sipht 1000 
Power Sum 10734800.6 22566020.6 33620561.4 335969269.8 97835 13.62 10291173.3 9934905.13 16022788.43 
(W) 
Average 28.6557284 28.6557203 28.65572104 28.65572239 21.99945901 21.9994593 21.9994591 21.99946127 
Power (W) 
Power 4367.658563 11228.74085 20813.04776 1070996.931 2812.991014 3158.219947 3174.261302 11211.22691 
Consumpti 
on (Wh) 
Simulation 3746.13 7874.87 11732.58 117243.34 4447.16 4677.92 4515.98 7283.26 
Time (sec) 


Figure 1 shows power sum results in contrast to DVFS-based resource utilization methodology 
using proposed ERU methodology for data-intensive workload dataset SIPHT for different job size as 30, 60, 
100 and 1000. The Figure 2 shows average power results in contrast to DVFS-based methodologies using 
proposed ERU methodology for data-intensive workload dataset SIPHT for different job size as 30, 60, 100 
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and 1000. The Figure 3 shows energy consumption results in contrast to DVFS-based methodologies using 
proposed ERU methodology for data-intensive workload dataset SIPHT for different job size as 30, 60, 100 
and 1000. 


Table 1. Computation efficiency performance evaluation of proposed ERU model over existing 
multi objective-based and DVFS-based workload scheduling algorithm 


DAGs Number of Average Simulation time (s) 
nodes DCOH [17] DVFS [10] ERU 
Sipht 30 30 178.92 124.871 148.2386667 
Sipht 60 60 194.48 131.2478333 77.96533333 
Sipht 100 100 175.55 117.3258 45.1598 
Sipht 1000 1000 179.05 117.24334 7.28326 
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Figure 1. Power sum comparison using proposed ERU model with DVFS-based workload scheduling 
algorithm 
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Figure 2. Average power consumption comparison using proposed ERU model with DVFS-based workload 
scheduling algorithm 


The Figure 4 shows execution time results in contrast to DVFS-based methodology using proposed 
ERU methodology for data-intensive workload dataset SIPHT for different job size as 30, 60, 100 and 1000. 
The outcomes achieved concludes the supremacy of ERU methodology in terms of average power, power 
consumption and power sum using SIPHT scientific dataset. Likewise, Figure 5 shows average execution 
time assessment with DVFS-based and DCOH methodology using modelled ERU methodology for data- 
intensive workload benchmark SIPHT for different job size as 30, 50, 100 and 1000. 
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Figure 3. Energy consumption comparison using proposed ERU model with DVFS-based workload 
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Figure 4. Execution time comparison using proposed ERU model with DVFS-based workload scheduling 
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Figure 5. Average execution time comparison using proposed ERU model with multi-objective based 
DVFS-based workload scheduling algorithm 
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5. CONLUSION 

Workload scheduling considering dynamic cache memory optimization under heterogeneous 
multicore environment is a challenging task. Recently, number of methodologies have aimed at bringing 
good tradeoffs among reducing energy and improving workload execution performance. An effective way of 
reducing energy dissipation is to employ DVFS technique; and for utilizing resource more efficiently and 
meet task deadline requires effective cache optimization technique. Thus, this paper presented a two phase 
cache resource optimization technique enabling V/F scaling in dynamic manner. From experiment it can be 
seen ERU improves energy efficiency by 44.29% over existing DVFS workload scheduling technique. 
Further, reduces execution time 43.215% and 61.72% over existing DVFS and DCOH workload scheduling 
technique, respectively. The proposed ERU workload scheduling model brings good tradeoffs in meeting 
task deadline with minimal execution time and energy consumption. Future work would consider evaluating 
performance of ERU considering diverse data intensive workload; and also consider employing evolutionary 
or deep learning technique to monitor and optimize QoS for executing workload. 
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